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Abstract — Base stations with a large number of transmit 
antennas have the potential to serve a large number of users 
simultaneously at higher rates. They also promise a lower power 
consumption due to coherent combining at the receiver. However, 
the receiver processing in the uplink relies on the channel 
estimates which are known to suffer from pilot interference. 
In this work, we perform an uplink large system analysis of 
multi-cell multi-antenna system when the receiver employs a 
matched filtering with a pilot contaminated estimate. We find the 
asymptotic Signal to Interference plus Noise Ratio (SINR) as the 
number of antennas and number of users per base station grow 
large while maintaining a fixed ratio. To do this, we make use 
of the similarity of the uplink received signal in a multi-antenna 
system to the representation of the received signal in CDMA 
systems. The asymptotic SINR expression explicitly captures the 
effect of pilot contamination and that of interference averaging. 
This also explains the SINR performance of receiver processing 
schemes at different regimes such as instances when the number 
of antennas are comparable to number of users as well as when 
antennas exceed greatly the number of users. Finally, we also 
propose that the adaptive MMSE symbol detection scheme, which 
does not require the explicit channel knowledge, can be employed 
for cellular systems with large number of antennas. 

I. Introduction 

Cellular systems with large number of base station antennas 
have been found to be advantageous in mitigating the fading 
effects of the channel OJ . In the downlink, dense low-powered 
base stations operating with power in the order of milliwatts 
have the potential to conserve power as compared to the 
present systems. In the uplink, coherent receiver processing 
with a large number of antennas reduces the transmitted 
powers of the users. It is shown in j2) that in an infinite 
antenna regime, and in a bandwidth of 20 MHz, a time division 
duplexing system has the potential to serve 40 single antenna 
users with an average throughput of 17 Mbps per user. 

However, any advantages offered by multiple transmitters 
at the base station can be utilized only by gaining the channel 
knowledge between the base station and all the users. This 
requires training data to be sent from the users. Hence, a 
part of the channel coherence time is utilized for gaining the 
channel knowledge between the base station and all users. 
In a typical system the time-frequency resources are divided 
into blocks of coherence-time coherence-bandwidth product, 
where some resources(time or frequency) are used for channel 
estimation and the rest is used for transmission in uplink or 
downlink. However, in (Tl, it is shown that the number of 
pilot symbols required is proportional to the total number of 
users in the system. Hence, as the system scales with the 



number of users, the dedicated training symbols may take up 
the coherence time of the channel. As this is undesirable, only 
a part of the coherence time is utilized to learn the channel. As 
a result, the pilot sequences in different cells overlap over time- 
frequency resource and as a consequence the channel estimate 
is corrupted. This is called pilot interference which is found 
to be a limiting factor as we increase the number of antennas 

It is shown in Q that in the limit of infinite number of 
antennas, the SINR using a matched filter receiver is limited 
by interference power due to pilot contamination. While the 
result assumes a regime with finite number of users, we can 
also envision a regime where the number of users may be 
comparable to the number of antennas such as a system with 
50 antenna base stations serving 50 users simultaneously. In 
this work, we do a large system analysis of uplink multi-cell, 
multi-antenna system when the receiver employs a matched 
filter to decode the received signal. We let the number of 
antennas and the number of users per base station grow large 
simultaneously while maintaining a fixed users to antennas 
ratio and observe the SINR in the following cases, 

1. when there is a perfect channel estimate, 

2. when we have a pilot corrupted channel estimate. 

In order to accomplish that we make use of the similarity of 
the uplink received signal in a MIMO system to that of the 
received signal in a CDMA system (7). Further we compare 
the results of the asymptotic SINR expression so obtained. 

Also, we propose an adaptive filtering method reminiscent 
of CDMA systems where the uplink receiver filter at the 
base station converges to the desired MMSE filter. While 
MMSE filtering requires us to estimate the channel to all the 
users in the system, the adaptive MMSE does not require any 
prior estimation of the channel to any of the users. Using 
independent training symbols among users, the receive filter 
will converge to the desired MMSE filter. 

II. System Model 

We consider a system similar to that in (2) with B base 
stations and K users per base station. We assume that all 
the KB users in the system are allocated the same time- 
frequency resource. Also, each base station is equipped with 
M antennas. The channel vector representing the small scale 
fading between user k in cell j and the antennas in base 
station / is given by a M x 1 vector hv^. The entries of 
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h^2 are assumed to be independent zero mean i.i.d Gaussian 
random variables with unit variance. This corresponds to an 
ideal and favourable propagation medium with rich scattering. 
The large scale fading coefficients, which represents the power 
attenuation due to distance and effects of shadowing between 
base station I and k th user in j th cell is given by p^. This is 
constant across the antennas of the c ell I. Accordingly, overall 

channel vector is given by gj2 = y Pfu^jk- 

A. Uplink Transmission 

We assume that all user's transmission are perfectly syn- 
chronized. Also, while a user's transmission is intended to its 
base station, other base stations also hear the transmission. 
Defining qjf. as the symbol transmitted by user k in cell j, 
w^) as the M x 1 noise vector with zero mean unit variance 
gaussian entries, p r /M as the uplink signal to noise ratio 
scaled by the number of antennas, the received signal at base 
station I is given by, 

3=1 k=l 

Since processing power is not an issue at the base station, 
received signal processing can be employed. However, the base 
station has to have an estimate of the channel to all users prior 
to transmission of uplink information. In a system employing 
OFDM physical layer with time-frequency resources, we can 
divide the resources into blocks spanning the coherence-time 
coherence-bandwidth product. Although the channel vector 
hj2 to each of the users has to be relearned by the base 
station at the start of every coherence time, once learnt for a 
subcarrier it remains the same for of all subcarriers spanning 
the coherence bandwidth during that coherence time. Let the 
number of coherent symbols be given by T c and the coherent 
subcarriers be N c . Therefore, if we fix the number of symbols 
used for estimation to be T such that T < T c , a total of N C T 
user's channel can be learnt. This observation was noted in \2\. 
We would like to point out that it is relevant here as the number 
of users than can be supported depends on N c and depending 
on its value the number of users K that could be supported can 
be comparable to M. Therefore, it is worthwhile to investigate 
not only M K scenario but also the case when M and K 
large and comparable. 

B. Limitations in gaining Channel Knowledge 

During each coherence time, users in a cell spend some 
pilot symbol times over the subcarriers spanning the coherence 
bandwidth for channel estimation at the base station and then 
the transmission of data ensues until the end of the coherence 
time. At base station /, the number of channel vectors that 
needs to be learnt is equal to the number of users in the system 
which is KB where K = N C T. In order to accomplish that, 
the number of pilots required must at least be KB symbol 
times. However, such a system will not be scalable as there 
exists some large B for which the product KB will occupy all 
the coherence time. This is clearly undesirable as pilot training 
is taking up significant part of a coherence time. 



C. Pilot Interference based Channel Estimate 

In one of the approaches taken in j2|, the base station 
is concerned with only knowing the channel to its own K 
users and spends only K time-frequency resources for channel 
estimation instead of KB. Every base station similarly spends 
first K time-frequency resources for channel estimation to 
its K users. Intuitively, we can think of K users of the 
I th base transmitting pilots in K orthogonal times-frequency 
resource to its base. Consequently, the user indexed one will 
be transmitting pilot symbol to its base station I in the first 
time slot in a subcarrier. Since the first user in other cells 
also transmits at the same time and at the same subcarrier, the 
received signal is corrupted by other pilots transmission. 

This results in an estimate as given below, 
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The variable k which can take values or 1 is in order to 
distinguish a perfect estimate and a pilot corrupted estimate 
respectively. Since our aim is to analyse the effect of pilot 
interference we assume that there is no uncorrelated gaussian 
noise further corrupting the estimate. It has been shown in (2j, 
[ 3 1 that using this estimate and a matched filter decoder the 
achievable rates are limited by inter-pilot interference as the 
number of antenna grows. 

III. Large System Analysis 

Since the SINR analysis is identical to all users in the system 
we focus only on user 1 in base station indexed 1. For matched 
filtering we project the received signal in equation ([!]) 
onto the normalized channel estimate g^/v^M. We also drop 
the superscript (.)^' for notational convenience. Consequently, 
the SINR of the first user to its base station with pilot corrupted 
channel estimate is given by, 
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Theorem 1. Let SINR obtained with matched filtering be as 
given in equation (jij, then as M,K — > oo, with K/M = a, 
SINR converges in probability to 



SINR* = 



Pr i+«(Ef=jfti)/fti 



1 



Pr 



Ba 



(4) 



'i+«(Ef =2 fti)//3ii 

where, the expectation is over the distribution of ft - the 
random variable representing the realizations of large scale 
fading gains /3' jk s. 

Proof: Proof given in Appendix ■ 
The expression for the asymptotic SINR captures the ef- 
fect of both interference due to pilot contamination and the 
interference averaging due to a ^ 0. The SINR expression in 
the limit of infinite antennas but finite number of users per 
cell are obtained when we put a = and this corresponds to 



the expression for SINR in Q, (3). It is seen that the SINR 
expression so obtained is limited by the pilot interference 
powers. On the other hand with a perfect channel estimate, 
Elk = Sxfc implying that k = we get the SINR of matched 
filter as in |7). We can view Theorem [T] as a generalization of 
the large system analysis for a matched filter receiver with a 
pilot contaminated channel estimate. 

IV. Numerical Evaluation 

For the numerical evaluation, we consider hexagonal cells 
with users uniformly distributed in each of the cells as shown 
in Fig. [3] We consider a set up where 6 closest cells are 
interfering with the center cell. We assume received powers 
from all the users are unity i.e (3jk — 1. We consider the SINR 
for the user one in the center cell. Fig. [T] plots the asymptotic 
SINR of the matched filter with a perfect estimate and that of 
a matched filter with a pilot corrupted estimate for the case of 
equal received powers. It is seen that using a pilot interference 
based estimate for matched filtering causes at least a 10 dB 
loss in the SINR. This is dependent on the users contributing 
to pilot interference based estimate and hence dependent on 
B. Fig. [2] shows that the random realizations of the SINR for 
different values of a are clustered around the asymptotic plots. 
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Fig. 1. The asymptotic limit of matched filter SINR with a perfect estimate 
is plotted with that of matched filter with a pilot corrupted estimate. The 
number of antennas considered is 50 and the signal to noise ratio is 20 dB. 

In order to get more intuition under practical scenarios of 
large scale fading gains, we consider the seven cell model with 
cell radius is R ', = 2 km, and assume a COST231 model for 
propagation loss between the base station and the users. The 
noise power is assumed to be —174 dBm and user transmit 
power of 23 dBm. We plot the CDF of the SINR in Fig. [4] for 
the following scenarios, 

• a = with k = corresponding to matched filtering with 
perfect estimate with number of antennas far exceeding 
the number of user. 

• a = and k = 1 corresponding to case when we employ 
matched filtering at the receiver with a pilot corrupted 
channel estimate similar to (2), 
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Fig. 2. We observe that the random realizations of the SINRs are clustered 
around the asymptotic limit. 
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Fig. 3. In the favourable case, the sum of the received powers of interferers 
contributing to pilot contamination are very less as compared to that of the 
desired user. User 1 in the center cell represents such a scenario. The SINR 
with a pilot corrupted estimate is then comparable to that of perfect estimate. 
On the other hand for user 2 in the center cell, the pilot interferers received 
powers are comparable to that of the desired user and represents the worst 
case scenarios. 



• a = 1 with 50 antennas and 50 users per base station and 
k = 0, representing the case of matched filter receiver 
with a perfect channel estimate, 

• a = 1 with 50 antennas and 50 users per base station and 
k — 1 as representing the case of matched filtering with 
a pilot corrupted channel estimate but with comparable 
number of antennas and users. 

Again, we only consider evaluating the SINR of the first user 
at the first base station as the SINR of all other users in the 
system follow the same statistics. As compared to the case 
when a = there is at least 25 dB loss in SINR when the 
number of antenna becomes equal to number of users. In the 
case of matched filtering with perfect estimate this can be 
explained entirely by the non-zero interference power because 
a = 1. When matched filtering with a pilot corrupted channel 
estimate is employed, the shift is due to the combination of 



both pilot interference power and that interference due to a 
1. More specifically consider a scenario when 
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This corresponds to the fact that sum of received powers of the 
interferers are much less that that of desired user power. Under 
these conditions the SINR of the received signal in equation 
(ffl is, 
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and hence the SINR of the matched filter with the corrupt 
channel estimate is as good as the SINR with a perfect channel 
estimate. This is seen in the best case SINR realizations in Fig. 
|4] In Fig. [3] the situation of user 1 in the center cell represents 
the favourable scenario with the interferers contributing to 
the pilot contamination are far such that the condition |5]) is 
satisfied. 

On the other hand if all the interferers gains are comparable 
to that of the desired users gains represented by 
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then pilot interference contributes negatively to the SINR in 
addition to interference averaging. This is seen in Fig. [4] for 
the worst case SINR realizations. In Fig. [3] the situation of 
desired user 2 represents such a scenario with desired user 2 
at the edge of the center cell and the interferers contributing in 
the pilot estimate as shown in the figure. These observations 
are true for all values of a. 
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Fig. 4. The figure shows the CDF of SINR for five different system situations 
is used to compare the effect of interference when the number of antennas 
and users are comparable. We can see that MMSE performs much better than 
matched filter with a corrupt estimate. 



V. Adaptive MMSE Filtering 

We propose that adaptive MMSE filtering can be used for 
each user to decode its uplink data. Adaptive MMSE does not 
require the explicit knowledge of the channel. Instead, pilot 



training sequences known at the base station are sent by every 
user simultaneously so that each user converges to its desired 
MMSE filter directly. The explanation for the implementation 
of adaptive MMSE is as follows. Given the received signal in 
equation ([T}, we wish to design an MMSE filter en for the 
first user in the first cell so that E[|gxi — cf^y | ] is minimized. 
From J8], the filter which minimizes mean square objective 
is given by en = (E[yy- tf ])~ 1 E[<7 1 , 1 y]. In order to form 
the filter en directly, the channel knowledge gjfc from the 
first base to all the users is required. A way to overcome this 
difficulty is by using stochastic gradient descent algorithm, 
so that the filter adaptively converges to MMSE as proposed 
for CDMA systems (§. With < t < T being the index of 
time-frequency resources spent for training data, {^n[t]} the 
T length training sequence for user indexed one and /i > 0, 
the progressively decreasing step size, a step in the algorithm 
is given by the equation, 

c u [t] = c u [t-l]-/*(V'ii[*]-cf 1 [t-l]y[t])y[*]. (7) 

It has been shown that there exist positive values of fi so that 
the iteration (|7]i converges to the MMSE filter. Also, as long 
as training data satisfies E[^fe^i*J = for {j, k} ^ the 
filter converges to the MMSE of user 1 in cell 1. Consequently, 
the SINR of the MMSE estimate is given by, 
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In Fig.|4]we plot the SINR of the MMSE filter assuming that 
the algorithm has converged to cu. As compared to the SINR 
with matched filtering, when a = 1, it is seen that MMSE 
can provide a gain of at least 10 dB because of interference 
cancelling capabilities. 

VI. Conclusion 

In this work we consider a multi-cell multi-antenna system 
with large number of antennas and users. We perform a large 
system analysis and derive the asymptotic SINR (equation Q) 
when a matched filter with a pilot contaminated estimate is 
employed at the receiver. We verified this expression numeri- 
cally by seeing that the random realizations of SINR converge 
to the asymptotic limit as we increase the number of antenna. 
We also showed that when the number of users is comparable 
to the number of antennas the performance of the matched 
filter with corrupt estimate is limited by the interference. This 
is in contrast with the case when number of antennas far 
exceed the number of users, where the pilot interference term 
has the effect of only reducing the 5 percentile SINR. We 
also proposed an adaptive MMSE symbol detection scheme 
where there is no necessity to estimate the channel. This could 
help in mitigating the effect of pilot contamination in the 
uplink. However, since the training symbols are limited due 
to coherence time of the channel, convergence to the MMSE 
filter will be an issue and this is a subject of current research. 



Appendix A 
Proof of TheoremQ] 

In equation Q numerator term can be simplified as, 
gggn _ (gfi + %AEf= 2 gji) ff gii 
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From (10 1, we observe that, liniM_ i . 00 SnSn/M = /?n. In 
equation Q, the first term in the denominator which is the 
noise term given by gngn/M is equal to, 
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From (12 1, we observe that, liniM^oo gfign/Af = fin + 



kX)j=i ftji- Converting the subscript of the channel vectors 
which is double indexed into subscript which is single indexed 
by using the definition g jk = ^%kYi jk = v //3^h( J -_ 1 )i f+fe = 
g(j-i)K+k, tne interference term in equation <j3j can be 
simplified as, 
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Using to denote convergence in probability, from ^ 
Proposition 3.3] we can show that in the limit of M — > oo, 
K -t oo, with K/M = a, 
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where, in equation ( fl4] > the first term converges due to (7] 
Proposition 3.3] and the second term converges due to weak 
law of large numbers. Assuming a ^ b, the other terms are of 
the form, 

BK 

pr h T * 

Sa gigfe gi 



M 2 ^ 

i=2 
_ Pr^ 

~ M 2 



i^l,a,b 
Pr n 



H T * 
g a gigfc gi 



|2 T * , Pr 
I Sb Sa + 



|2„B„ 
I ga gb 



(15) 



In a similar approach to analysis of the interference term in 
(7] Proposition 3.3] we can show that in equation ( p~5] > the first 
term converges to its expectation which is and the second 
and third term converges to by weak law of large numbers. 
Since, each of the above terms converge in probability, the sum 
of the terms also converge in probability. After rearranging the 
terms we get the expression of SINR as in Theorem [T] 
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